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(Si) Digital scanning technique. 

(57) A novel method and system enabling a com- 
puter to identify the beginning and end of a 
logical group of digitally scanned pages, most 
commonly a multipage document, through the 
use of cover pages or sheets containing 
machine-readable information about where the 
document should be routed over a computer 
network, how to set scanning parameters in the 
scanner, and additional operations to perform, 
greatly increasing the efficiency and simplicity 
of creating, routing and processing scanned 
image documents without operator or user in- 
tervention and thereby enabling increased use 
of scanners in offices and the like. 
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The present invention relates to the digital scan- 
ning of documents coirrprBSing a ptasiifcy or stack of 
sheets or pages containing printed information, being 
more particularly concerned with the automatic iden- 
tification of the beginning and ending of a logical 
group of pages during digital scanning, the identifica- 
tion and routing and storing of the digitally scanned 
sheets in a computer network, and the setting of para- 
meters in the scanner and additional operations to be 
performed, all without the need for operator control or 
intervention. 

BACKGROUND 

Within recent years, computers have automated 
the process of creating documents in offices. Despite 
the explosive growth in such computer use for creat- 
ing documents using word processors, spreadsheets 
or graphics programs, however, studies have shown 
that over 95% of information in offices still remains in 
paper form. In the mid 1980's, the first commercial 
systems were introduced that addressed this prob- 
lem. They allowed paper documents to be digitally 
scanned and turned into a digital image of that docu- 
ment. These documents could then be stored on mag- 
netic disks, sent over computer networks, printed to 
create copies and indexed for future retrieval with a 
database. They are comonly called document imag- 
ing systems, and a summary of the same is presented 
in "Document Imaging Systems: Technology Over- 
view," June 1990, Datapro Research." 

The creation of scanned documents using these 
systems is a laborious process, and several standard 
procedures are in use. One method is to scan a stack 
of sheets or pages and convert them to digitized im- 
ages. This set of image pages can then be viewed on 
a computer display, one at a time, in the same order 
in which they were scanned. By paging through this 
batch, associated pages of a single document can be 
visually identified and electronically "stapled" or as- 
sembled together with appropriate software. Index 
information is often entered at this time into a data- 
base so that the document can later be searched for 
and retrieved. 

A second method involves scanning one docu- 
ment at a time and using an automatic document feed- 
er (ADF) to signal the and of the document through 
an out-of-paper condition, or through manual opera- 
tor data entry at a keyboard, or by action with a point- 
ing device. The start of a document is established by 
initiating the scanning process. Once again, index in- 
formation is often entered into a database at this step. 

A third method involves scanning one page at a 
time and displaying each digitized page on a comput- 
er screen, and enabling the operator to enter notation 
through the keyboard or a pointing device specifying 
the beginning and end of the document. 

Should further processing of the document im- 



age information be required as brothers m the organ- 
ization, and the computer scanning station is connect- 
ed to a computer network, the operator may also 
route this document to another individual on the net- 
5 work for such further processing. This activity may be 
added as the last step to all three above-described 
procedures. 

Underlying the present invention, is the discov- 
ery of a more economical method of creating and 

10 routing the digitally scanned documents by eliminat- 
ing the need for an operator during the scanning and 
routing, such also allowing for more ad hoc use of a 
scanner by all individuals in an office connected to 
the network. Whereas existing document imaging 

15 systems, as previously outlined, are used in applica- 
tions involving repetitive handling of similar types of 
documents, the present invention opens up the use 
of scanners for random use in an office. 

Specifically, the method of the invention involves 

20 the use of a cover page which contains both machine 
and human-readable information, but with machine- 
readable code information containing the network 
address or identifier for the individual to whom the 
documents should be assigned or routed, as well as 

25 settings for the scanner and/or additional operations 
to be performed, such as OCR (optical character rec- 
ognition), and the like— all without the need for oper- 
ator control or intervention. Stacks of paper contain- 
ing printed information (where the term "printed" is in- 

30 tended generically to embrace all of type, handwrit- 
ten, hand drawn, photographic or other information 
placed on the paper or sheet) are placed into the ADF 
at random for scanning. The only requirement is that 
each logical set of pages (normally a document) 

35 needs to be preceded by such a cover page contain- 
ing the particular machine-readable information 
above specified. If it is such a cover page, it automat- 
ically extracts the machine-readable information from 
the digitized page, recognizes it and records the in- 

40 formation. When a new cover page is detected or 
when the scanner ADF is empty, then the previous 
page becomes the last page of the document and all 
pages or the document are electronically "stapled" 
into a single identifiable entity which can be routed as 

45 specified by the machine-readable code and stored. 
When a user at a desktop computer opens the user's 
electronic inbox, each document is accordingly indi- 
vidually identified. Cover pages per se have, of 
course, been used in facsimile (FAX) and prior scan- 
so ning systems and the like, but not in the manner or 
with the particular critical information and response 
thereto of the present invention. 

In United States Patent #5,129,016, issued July 
7, 1992, for example, a cover page is employed, in- 

55 eluding unique information for that particular docu- 
ment and involving the indexing of a document to a 
database. This is for the very different purpose of im- 
age registration for an image filing system. As such, 
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accordingly, the system requires printing a specific 
cover page for each document and assuring such is 
used on the correct one. Such a technique, moreover, 
is prone to error and requires the additional steps of 
printing the cover pages, matching the unique cover 
page to a particular document, and then attaching the 
page. It also requires database information to be en- 
tered prior to the scanning process. 

In many cases, furthermore, scanned documents 
are used for immediate communication, either by 
sending such out via computer FAX in an electronic 
mail system or to convert the document to editable 
text using an OCR process. There is, therefore, in 
such instances, no need to use an index page just for 
indexing the document as in said patent, because it 
will be used for a specific purpose and then deleted. 

The present invention, with its quite different cov- 
er page purposes and operation, on the other hand, 
provides for great simplification of the scanning proc- 
ess for general office use by requiring only a single 
cover page for each user. 

OBJECTS OF INVENTION 

An object of the present invention, accordingly, is 
to provide a new and improved method of and system 
and apparatus for creating and routing digitally scan- 
ned documents that are significantly more economic- 
al and efficient than current techniques and enable 
elimination of the need for an operator during the 
scanning, routing and storing, and provide for in- 
creased potential use of scanners in offices and the 
like. 

A further objective is to provide such an improved 
technique and apparatus through use of a novel cover 
page having machine-readable coded information 
that contains the network address or identifier for the 
document owners to whom the document should be 
assigned or routed, as well as settings for the scanner 
and/or additional operations to be performed, with 
electronic scanner-computer response to all such in- 
formation. 

Still a further object is to provide such a novel 
cover sheet and the scanner-computer supplemental 
facility to read and respond to the machine-readable 
information provided thereon. 

Other and further objects will be described here- 
inafter and are delineated in the appended claims. 

SUMMARY OF INVENTION 



otherwise reproduced, or any other irrfbrmation car- 
ried by the sheet), that comprises, preparing a stack 
of successive sheets for drgitalrzing scanning; pre- 
ceding the top sheet and all other logical sets of 

5 sheets within the stack with a cover sheet bearing 
machine-readable code markings containing identifi- 
cation of the document owner and desired routing of 
the digital information in the computer network; feed- 
ing the cover sheet(s) and stack of sheets into a digi- 

10 tizing scanner; during the scanning, recognizing the 
presence of the cover sheet and thus the beginning 
of the document scanning and reading the machine- 
readable code markings of the cover sheet and re- 
cording the identification-routing information thereof; 

15 upon the identification of a new cover sheet or the ab- 
sence of further sheets fed to the scanner, identifying 
all sheets scanned from the previous cover page as 
a document and thereupon routing the digitized docu- 
ment information scanned therefrom in accordance 

20 with the recorded cover sheet identification-routing 
information; and storing the same in the computer 
network for subsequent retrieval or further process- 
ing, e 

Best mode and preferred designs and em bod i- 

25 ments are hereinafter set forth. 

DRAWINGS 

The invention will now be described with refer- 
30 ence to the accompanying drawings, Fig. 1 of which 
is an isometric view of successive stacks of document 
sheets provided with appropriate cover pages in ac- 
cordance with the invention; 

Figs. 2Aand 2B are schematic diagrams of digital 
35 scanner-ADF-computer interfacing, Fig. 2A em- 

ploying computer-controlled machine-recogniz- 
ing of the cover page code information externally 
of the scanner, and Fig. 2B illustrating such em- 
bedded with the scanner; 
40 Fig. 2 C is a similar diagram illustrating further 

processing such as optical character recognition 
(OCR); 

Figs. 3 A and 3B are flow or operational charts de- 
picting steps in the practice of the invention, with 
45 the starred blocks requiring no user intervention; 

and 

Fig. 4 is a sample cover page. 
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DESCRIPTION OF PREFERRED 
EMBODIMENT(S) OF INVENTION 



In summary, however, from one of its viewpoints, 
the invention embraces a method of automatically 
creating, identifying, routing and storing in a computer 
network, digitally scanned documents comprising a 
stack of sheets containing printed information, 
(where, as before stated, this term as used herein, 
embraces type, handwritten or drawn photographic or 



Turning now to Fig 1, pluralities or stacks of 
sheets or pages, Pg. 1, Pg. 2, Pg.3, etc.; Pg. 1 1 , Pg.2 1 
etc., comprising the documents) of printed informa- 
55 tion to be digitally scanned, routed and stored are 
shown provided with corresponding cover pages 1 , 1 1 
containing the before-described machine-readable 
coded information MRI, as, for example, in preferred 
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bar code format but also in any other suitable format, 
as desired. To accommodate for deferent scanning 
orientation in identifying the cover sheet and reading 
the coded MRI, the latter is preferably placed in op- 
posite-side corners of the cover page, as shown in 5 
Fig. 1. For purposes of identification in assembly for 
the ADF f human-readable information may also be 
provided on the cover pages, as at HRI. 

As earlier explained, in accordance with the pres- 
ent invention, the MRI, such as the bar-coded infor- 10 
mation on the cover page, contains the network ad- 
dress or identifier for the document owner to whom 
the document is assigned or is to be routed and, op- 
timally,settings for the scanner S, Figs. 2A and 2B, 
and/or additional operations, such as OCR and the 15 
like, later more fully explained. The scanner S is sup- 
plementary equipped, either internally or by external 
equipment, with an apparatus MRI code reader R to 
identify during the scanning of the cover page 1, the 
fact that it is a cover page, and by computer program 20 
interfacing, externally at CP in Fig. 2A, or internally as 
embedded in the scanner at CP 1 in Fig. 2B, to re- 
spond by extracting the MRI from the scanned digital 
cover page, recognizing it and recording the informa- 
tion. 25 

When a new cover page 1 1 is encountered, or the 
scanner ADF is empty, the previous page (Pg. n+1) 
becomes the last page of the document and all pages 
(now in digital format) are electron icatly "stapled" or 
assembled into a single identifiable entity under the 30 
control of the programming CP (or CP 1 ) which is then 
routed and stored at CN. The document may then be 
further processed for retrieval, viewing, printing, etc. 
and further routing by the user. 

More specifically, referring to the flow charts of 35 
Figs. 3A and 3B, such method of document separa- 
tion and routing is shown commencing with the step 
of preparing or separating of the documents (s) with 
cover sheets in the top step block 2 of Fig. 3A. After 
ascertaining if the scanner is quiescent at 4 and plac- 40 
ing the stacks at 6, Fig. 1, with appropriate cover 
sheets 1, 1 1 on the scanner feeder ADF (and, if the 
scanner is not stopped, stopping the same at 8, re- 
moving existing documents on the ADF at 10, and ap- 
pending new documents with cover sheets and stack- 45 
ing on the ADF at 12), the scanner S is started at 14 
by keyboard entry or the use of a pointing device or 
mouse. If everything is in order at scanner condition 
check 1 6 and the digital data image data stream is col- 
lected and stored at 18, the digital data on the scan- so 
ned cover page 1 is analyzed and the MRI read and 
recognized at 20, (box A, Fig. 3B), being then record- 
ed at 22; i.e. the routing and scanner configuration in- 
terfacing, etc. parameters, including resolution, 
brightness, contrast, mode (color, grayscale or bito- 55 
nal), dither, page size, scan region and the like. 

Referring to a sample cover page presented in 
Fig. 4, the bar code is shown followed by user- 



selectable boxes the marking of which indicates to 
the MRI reader, scan parameter settings ("FAX — re- 
solution setting; *PrCT--dither setting; and "COLOR"- 
-mode setting), and whether there is to be further 
OCR (or other) processing. 

Such an OCR operation is shown in Fig. 2C 
where, if the MRI contains an OCR processing re- 
quest ("OCR" box checked in Fig. 4), the computer 
CP routes the documents to the OCR server O, 
where the digitized image data stream is converted to 
textural information and routed to network CN. 

As more particularly detailed in the flow diagram 
of Fig. 3B, the reading of MRI-scanner parameter set- 
ting information at 22 provides appropriate setting 
commands at 23 to the scanner S ("commands" in 
Figs. 2Aand 2C) or through internal setting (Fig. 2B), 
which is the before-mentioned resolution, dither, 
mode, etc. 

Should more page data be required at 1 8, such is 
accommodated and the check at 16 repeated. Should 
no cover page be indicated at 20, however, the digi- 
tally scanned page information will be stored at 15 
and electronically associated with the document at 17 
and then applied in the next page scanning at 30, Fig. 
3B. 

Upon the conclusion of the scanning of the docu- 
ment (Pg. n+1, Fig. 1) at 24, and the advent of the 
next cover page 1 1 (or empty ADF), the document, ap- 
propriately routed is stored in the network at 26, and, 
if desired, the cover sheet data may be discarded, as 
at 28. The next page scanning commences at 30 as 
the next document (1 1 ), Pg. 1 1 , Pg. 2 1 , Fig. 1) is au- 
tomatically fed at the ADF. The operation then returns 
at D to step 16, Fig. 3A. 

If, however, during the scanner condition check at 
16, there is no longer a sheet or page in the ADF, an 
end-of-document signal is generated at 3. If there had 
been a cover sheet with its MRI information in the pre- 
ceding scanner operation at 5, indicating, for exam- 
ple, that a proper document such as in Fig.1 had been 
digitized by the scanner, the digitized pages are col- 
lected or "stapled" and stored as the digitized docu- 
ment at 7 and at the cover- page-directed routed ad- 
dress or owner's identification. The scanner is then 
stopped at C and 9, Fig. 3B. 

On the other hand, if there was no scanned cover 
sheet at 5, Fig. 3A ( the document is discarded (B and 
1 1 , Fig. 3B) and the scanner stopped at 9 and/or error 
notification is sent to the system administrator over 
the network. 

For purposes of visualization of the operation of 
the scanner and that the parameters are correct, a 
display monitor DS may be provided as shown in 
Figs.2A, B and C. 

Returning to Fig. 3A, if, during the scanner con- 
dition check at 1 6 there had been a paper jam or error, 
such would be indicated for correction at 13, and then 
a new start initiated at 14. 
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A prototype of the invention has been successful- 
ly operated with a Hewlett-Packard Scanjet Hp scan- 
ner, with an ADF connected to an IBM PC/ AT compat- 
ible computer which controlled the scanner operation 
and performed the recognition of the machine-read- 
able code with an application running under Microsoft 
Windows. This machine was connected to a Novell 
NetWare 386 network over which the documents 
were routed. For the prototype, Code 39 Barcode was 
used at 36 points for the machine-readable code. It 
was placed starting at 1 1/4 inch from the top and left 
edges of the paper. In addition, human-readable in- 
formation was placed in the center of the cover page 
so that the user can easily identify the owner of the 
cover pages, providing individual names as well as in- 
formation and inventories about specific scanning 
parameters. 

Some of the important features of the invention 
can be accomplished by attaching or adding to the 
first document page or sheet, the MRI coded informa- 
tion; or similarly to the end of the last sheet, or on a 
backing sheet. 

Further modifications will also occur to those skil- 
led in this art and are considered to fall within the spir- 
it and scope of the invention as defined in the ap- 
pended claims. 



Claims 



2. A method as claimed in claim 1 and in which the 
machine-readable code markings contain also 
scanner parameter information adapted for the 
nature of the information on the sheets; and re- 

5 sponding to the reading thereof appropriately to 

set the scanner parameters during the digitizing 
scanning. 

3. A method as claimed in claim 2 and in which the 
10 machine-readable code markings additionally 

contain further operational instructions including 
for optical recognition. 

4. Amethod as claimed in claim 2 and in which in the 
15 event that the presence of a cover sheet is not 

recognized during the scanning, the document is 
discarded and/or an error message is sent. 

5. Amethod as claimed in claim 1 and in which the 
20 cover sheet is also provided with human-read- 
able identification information. 

6. Amethod as claimed in claim 1 and in which the 
machine-readable code markings are in bar code 

25 format. 

7. For use with a digitizing scanner, a method of au- 
tomatically creating, identifying, routing and stor- 
ing in a computer network documents comprising 
a stack of sheets containing information- to-be- 
scanned and digitized, that comprises, preceding 
the top sheet and all other logical sets of sheets 
within the stack with a cover sheet bearing ma- 
chine-readable code markings containing identi- 
fication of the document owner and desired rout- 
ing of the digitized information in the computer 
network, and scanner parameter information; 
feeding the cover sheet and stack of sheets of the 
document into said digitizing scanner; recogniz- 
ing the presence of the cover sheet at the begin- 
ning of the scanning and reading its machine- 
readable code markings; recording the identifica- 
tion-routing information read from the cover 
sheet code markings; responding to the cover 
sheet code markings containing said scanner 
parameter information to set the scanner with 
those parameters; upon the identification of a 
new cover sheet or the absence of further sheets 
fed to the scanner, identifying all sheets scanned 
from the previous cover sheet as a document, 
and thereupon routing the scanned and digitized 
document information in accordance with the re- 
corded cover sheet identification-routing infor- 
mation; and storing the same in the computer 
network. 

8. A method as claimed in claim 7 and in which the 
recognizing recording, responding and routing 



1. A method of automatically creating, identifying, 
sorting and storing in a computer network, digi- 
tally scanned documents comprising a stack of 
sheets containing information thereupon, that 
comprises, preparing a stack of successive 35 
sheets for digitalizing scanning; preceding the 
top sheet and all other logical sets of sheets with- 
in the stack with a cover sheet bearing machine- 
readable code markings containing identification 
of the document owner and desired routing of the 40 
digital information in the computer network; feed- 
ing the cover sheet(s) and the stack of sheets into 
a digitizing scanner; during the scanning, recog- 
nizing the presence of the cover sheet and thus 
the beginning of the document scanning, and 45 
reading the machine-readable code markings of 
the cover sheet and recording the identification- 
routing information thereof; upon the identifica- 
tion of a new cover sheet or the absence of fur- 
ther sheets fed to the scanner, identifying all so 
sheets scanned from the previous cover sheet as 
a document and thereupon routing the digitized 
document information scanned therefrom in ac- 
cordance with the recorded cover sheet identifi- 
cation-routing information; and storing the same 55 
in the computer network for subsequent retrieval 
or further processing. 



BNSDOCID: <EP 0596724A1_L> 




steps are performed by Gompufcar processing in- 
terfacing with the scanner. 

9. A method as claimed in claim 8 and in which the 
computer processing is effected externally of the 5 
scanner between the scanner and the computer 
network. 

10. A method as claimed in claim 8 and in which the 
computer processing is embedded within the 10 
scanner. 

11. A method as claimed in claim 8 and in which the 
machine-readable code markings are in bar-code 
format, and the scanner is provided with bar-co- 15 
de reading and recognizing capability. 

12. In a digitizing scanner system for documents 
comprising a stack of sheets containing printed 
information to-be-digitized, apparatus for auto- 20 
matically creating, identifying, routing and storing 
such information in a computer network in digi- 
tized form, having, in combination with a digitiz- 
ing scanner, means for feeding to the scanner the 
successive logical sets of sheets of the stack 25 
each preceded by a cover sheet bearing ma- 
chine-readable code markings containing identi- 
fication of the document owner and desired rout- 
ing of the digitized information in the computer 
network; means connected with the scanner for 30 
recognizing the presence of the cover sheet at 

the beginning of the scanning and reading its ma- 
chine-readable code markings; and means oper- 
ative upon identification of a new cover sheet or 
the absence of further sheets at the scanner, for 35 
identifying all sheets scanned from the previous 
cover as a document; means responsive to the 
last-named means for thereupon collecting and 
routing the scanned end digitized document in- 
formation in accordance with the recorded cover 40 
sheet identification-routing information; and 
means for storing the same in the computer net- 
work. 

13. Apparatus as claimed in claim 12 and in which the 45 
cover sheet is provided with further machine- 
readable code markings containing scanner 
parameter setting information; and the scanner is 
provided with means for reading and responding 

to the further code markings to set the scanner so 
with those parameters for the scanning of the 
sheets of the stack. 

14. Apparatus as claimed in claim 13 and in which the 
further machine-readable code markings also in- 55 
elude other operational instructions including 
optical recognition functions. 



15. Apparatus as claimed in claim 12 and in which the 
recognizing, recording, indicating, collecting and 
routing means are controlled by computer proc- 
essing means interfacing with the scanner. 

1 6. Apparatus as claimed in claim 1 5 and in which the 
computer processing means is disposed extern- 
ally of the scanner between the same and the 
computer network. 

17. Apparatus as claimed in claim 15 and in which the 
computer processing means is embedded within 
the scanner. 

1 8. Apparatus as claimed in claim 1 3 and in which the 
machine-readable code markings are in bar-code 
format and the scanner is provided with bar-code 
reading and recognizing means. 

19. For use with a document-digitizing scanner con- 
nected with a computer network, a document 
cover sheet bearing machine-readable code 
markings containing identification of the docu- 
ment owner and desired routing of the digitized 
document in the computer network for recogni- 
tion and reading by and routing from the scanner. 

20. A document cover sheet as claimed in claim 19 
and in which the machine-readable markings fur- 
ther contain scanner parameter setting informa- 
tion for response thereto by the scanner as it 
scans the document. 

21. A document cover sheet as claimed in claimed in 
claim 19 and in which the machine-readable code 
markings are displayed on opposite-side corners 
of the cover sheet. 
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